On Item Mappings and Statistical Rules for Selecting Binary Items for Criterion-Referenced Interpretation and Bookmark Standard Settings
نویسنده
چکیده
Item mappings are widely used in educational assessment for applications such as test administration (through test form assembly and computer assisted testing) and for criterion-referenced (CR) interpretation of test scores or scale anchoring. Item mappings are also used to construct ordered item booklets in the CTB/McGraw Hill Bookmark standard setting procedure. Selection rules for mapping the items vary with the purpose of the mapping. The objective of this paper is to categorize various types of item mappings, to describe ways to assess the consequences of a given item selection rule for mapping a binary item, and to provide a general empirical Bayes framework from which specific selection rules can be derived. A comparison is made on the maximum information (MI) rules and those derived from an empirical Bayes (EB) approach. It is noted that the EB rules coincide with the MI rules if the correction for guessing formula is used to extend the EB rules for Rasch and two parameter logistic items to the EB rules for three parameter logistic items. (Contains 13 references.) (Author/SLD) Reproductions supplied by EDRS are the best that can be made from the original document. O O On Item Mappings and Statistical Rules for Selecting Binary Items for Criterion-Referenced Interpretation and Bookmark Standard Settings Huynh Huynh University of South Carolina Item mappings are widely used in educational assessment for applications such as test administration (through test form assembly and computer assisted testing, CAT) and for criterion-referenced (CR) interpretation of test scores or scale anchoring. Item mappings are also used to construct ordered item booklets in the CTB/McGrawHill Bookmark standard setting procedure. Selection rules for mapping the items vary with the purpose of the mapping. The objective of this paper is to categorize various types of item mappings, to describe ways to assess the consequences of a given item selection rule for mapping a binary item, and to provide a general empirical Bayes framework from which specific selection rules can be derived. A comparison is made on the maximum information (MI) rules and those derived from an empirical Bayes (EB) approach. It is noted that the EB rules coincide with the MI rules if the correction for guessing formula is used to extend the EB rules for Rasch and 2PL items to the EB rules 3PL items.
منابع مشابه
Marginal True-Score Measures and Reliability for Binary Items as a Function of Their IRT Parameters
This article provides analytic evaluations of population true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distribution, the expected values of marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-referenced interpretations are presented as a function of the it...
متن کاملBookmark Experience 1 Running head: COGNITIVE EXPERIENCE OF BOOKMARK PARTICIPANTS The Cognitive Experience of Bookmark Standard Setting Participants
The purposes of the study were to investigate participants’ understanding of the Bookmark procedure, item selection strategies, and factors influencing judgments. Data were collected at two state standard settings. Participants completed surveys and table leaders completed think-aloud procedures following each round. Survey results indicated understanding improved from Round 1 to Round 2, and r...
متن کاملThe Bookmark Standard Setting Procedure 3 The Bookmark Standard Setting Procedure : Strengths and Weaknesses
The Bookmark standard setting procedure was developed to address the perceived problems with the most popular method for setting cutscores, the Angoff procedure (Angoff, 1971). The purpose of the present paper is to review the Bookmark procedure and then evaluate it in terms of Berk’s (1986) criteria for evaluating standard-setting methods. Finally, the strengths and weaknesses of the Bookmark ...
متن کاملReliability of True Cutting Scores for Rasch Calibrated Items
This paper provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty parameters when the trait distribution is normal or logistic. With the proposed formula, one can evaluate the theoretical values of classical reliability indexes for norm-referenced and criterion-referenced interpretations without information about raw-score or t...
متن کاملReview Psychometric Parameters of the 29th Residency Test (1380) According to the Classic Test Theory (CTT)
Introduction. To select the best group, and to make a good decision, are of the most important worries of the health and medical education ministry and also all entrants in the residency test. Having and performing a reliable and good exam will reduce doubts to a great deal. Considering different scientific methods consist of (precisely review of curriculum by the designer committee, sampling o...
متن کامل